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Abstract 

The paper establishes a functional version of the Hoeffding combi- 
natorial central limit theorem. First, a pre-limiting Gaussian process 
approximation is defined, and is shown to be at a distance of the order 
of the Lyapounov ratio from the original random process. Distance is 
measured by comparison of expectations of smooth functionals of the 
processes, and the argument is by way of Stein's method. The pre- 
limiting process is then shown, under weak conditions, to converge to 
a Gaussian limit process. The theorem is used to describe the shape 
of random permutation tableaux. 
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1 Introduction 

Let := (a^ 1 < i,j < n), n > 1, be a sequence of real matri- 

ces. Hoeffding's (1951) combinatorial central limit theorem asserts that if it 
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is a uniform random permutation of {1,2, ... ,n}, then, under appropriate 
conditions, the distribution of the sum 



n 

4 B) ■■= E fl S n) (^«). 



i=i 

when centred and normalized, converges to the standard normal distribution. 
The centring is usually accomplished by replacing a^\i,j) with 

aM(*,j) := a^(i,j)-a^(+,j)-a^(i,+) + a^\+,+), 

where 

n n 

i=i i=i 

n 

i=i 

This gives = sf - ES^ n) , and the variance VarS^ = VarSf is then 
given by 

n 

{5™ (a)} 2 := (n-l)- 1 ^^)} 2 . 

Bolthausen (1984) proved the analogous Berry-Esseen theorem: that, for 
any n x n matrix a, 

sup |P[S - m(a) < xs(a)} - $(x)\ < CA(a), 
for a universal constant C, where 

n n 

So := y^ao(i,7r(z)), m(a) := a (i, j) = ESb, 

i=l ij=l 

§ 2 (a) := (n- l)- 1 £ a 2 (*,j) = VarS, (1.1) 
(we tacitly assume n > 2 when necessary) and 



1 n 
A(a) := r V 



a(«,j)| 3 



is the analogue of the Lyapounov ratio. 
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In this paper, we begin by proving a functional version of Bolthausen's 
theorem, again with an error expressed in terms of a Lyapounov ratio. When 
centring the functional version So(t) := Yl\=i a o(z, 7r(i)), < £ < 1, it is 
however no longer natural to make the double standardization that is used 
to derive a from ao- Instead, we shall at each step centre the random variables 
ao(i, 7r(z)) individually by their means ao(i, +). Equivalently, in what follows, 
we shall work with matrices a satisfying d(i, +) = for all i, but with no 
assumption as to the value of a(+, j). For example, if we have ao(i, j) = b(i) + 
c(j), then a(i,j) = for all i,j, and hence So = ESo = na (+, +) = n(b + c) 
is a.s. constant. However, we are interested instead in 

LrctJ 

S ( f ) := X^{ a o(«',7r(i)) - a (i,+)}, 
i=i 

giving S(t) = X)i=i ^^{ c ( 7r (0) — a non-trivial process with a Brownian 
bridge as natural approximation. 

We thus, throughout the paper, define the matrix a by 

a(i,j) := a (i,j)-a (i,+), (1.2) 

so that a(i, +) = 0. Correspondingly, we define 

\nt\ 

S{t) := ^a(i,7r(i)) = S (t)-ES (t). 
i=i 

We then normalize by a suitable factor s(a) > 0, and write 

Y(t) := 3(a)- 1 S(t) = s(a)- l (S (t) -ES (t)); (1.3) 
this can equivalently be expressed as 

1 - 

Y := F(tt) := — a(i, tt(z)) J i/n , (1.4) 
v ' 1=1 

where J u (t) := l[ Ui i](i). In Theorem 12.11 we approximate the random func- 
tion Y by the Gaussian process 

n 

Z := ^WiJi/n, (1.5) 

8=1 
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in which the jointly Gaussian random variables (W$, 1 < i < n) have zero 
means and covariances given by 



VaxWi = 4TE fl2 W) 
ns z (a) ^— ' 



z=i 



1 n 

Cov (W u Wj) = r Va(i,l)a(j,i) =: o^-, i ^ j. 



;i.6) 



i=i 



A simple calculation shows that Cov (a(i, it(i)), a(j, it (J))) = s 2 (a)<7ij for all 
i,j, and thus the covariance structures of the processes Y and Z are identical. 
The error in the approximation is expressed in terms of a probability metric 
defined in terms of comparison of expectations of certain smooth functionals 
of the processes, and it is bounded by a multiple of the Lyapounov ratio 

A(a) := ^^Ela^j)! 3 - ( L7 ) 

The normalization factor s(a) may be chosen in several ways. One obvious 
possibility is to choose s(a) = s(a) defined in ( II. ip . which makes VarF(l) = 
VarZ(l) = 1. At other times this is inappropriate; for example, as seen 
above, s(a) may vanish, although we have a non-trivial Brownian bridge 
asymptotic. A canonical choice of normalization is 



* 2 ( 



1 n 



n 



or, for simplicity, n~ x ^^=1 a2 (^i)> which makes no difference asymptoti- 
cally. In the special case where a(+,j) = for each j, as with the matrix a, 
this gives s 2 (a) = s 2 (a), so Var Y(l) = Var Z{\) = 1, but in general this does 
not hold. In specific applications, some other choice may be more convenient. 
We thus state our main results for an arbitrary normalization. 

In most circumstances, such an approximation by Z = Z^ depending 
on n is in itself not particularly useful; one would prefer to have some fixed, 
and if possible well-known limiting approximation. This requires making 
additional assumptions about the sequence of matrices as n — >• oo. In 
extending Bolthausen's theorem, it is enough to assume that AW(o) -> 0, 
since the approximation is already framed in terms of the standard normal 
distribution. For functional approximation, even if we had standardized to 
make VarF(l) = 1, we would still have to make some further assumptions 
about the a^ n \ in order to arrive at a limit. A natural choice would be to take 
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a {hj) '■= ot(i/n, j/n) for a continuous function a: [0, l] 2 — > M. which does 
not depend on n. We shall make a somewhat weaker assumption, enough 
to guarantee that the covariance function of converges to a limit, which 
itself determines a limiting Gaussian process. The details are given in Theo- 
rem l3.3l Note that we require that A^ n \a) log 2 n — > for process convergence, 
a slightly stronger condition than might have been expected. This is as a 
result of the method of proof, using the approach in Barbour (1990), in which 
the probability metric used for approximation is not obviously strong enough 
to metrize weak convergence in the Skorohod topology. Requiring the rate of 
convergence of AW (a) to zero to be faster than 1 / log n is however enough 
to ensure that weak convergence also takes place: see Proposition 13. II 

The motivation for proving the theorems comes from the study of per- 
mutation tableaux. In Section El we show that the boundary of a random 
permutation tableau, in the limit as its size tends to infinity, has a partic- 
ular shape, about which the random fluctuations are approximately Gaus- 
sian. The main tool in proving this is Theorem I3.3[ applied to the matrices 

do\hj) '■= 1 {i<j}- 

2 The pre-limiting approximation 

We wish to show that the distributions of the processes Y and Z of ( 11 .41) 
and (11. 5p are close. To do so, we adopt the approach in Barbour (1990). 
We let M denote the space of all twice Frechet differentiable functionals 
/ : D := D[0, 1] — >• R for which the norm 

\\f\\ M := sup{|/H|/(l+|h|| 3 )} + sup{|| J D/H||/(l+||^|| 2 )} (2.1) 
+ sup{|| J D 2 /H||/(l + H|)} + sup {\\D*f(w + h)- £>7HII/||^I|} 

weD w,heD 

is finite; here, || • || denotes the supremum norm on D, and the norm of 
a (symmetric) fc-linear form B on function in D is defined to be ||5|| := 
su PheD: \\h\\=i \B[h^)\i where denotes the fc-tuple (h, h, . . . , h). Our aim 
is to show that |Egr(y) — E,g(Z) \ is small for all g G M. We do this by Stein's 
method, observing that, for any g G M, there exists a function / G M 
satisfying 

g(w)-Eg(Z) = (Af)(w) := -Df(w)[w] + ^T a^D 2 f(w)[J i/n , J j/n ], (2.2) 
and that 

H/IIm < C \\g\\ M , (2.3) 
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where C$ does not depend on the choice of g: see, for example, Barbour 
(1990, (2.24), Remark 7 after Theorem 1 and the remark following Lemma 
3.1). Hence it is enough to prove that \E(Af)(Y)\ < e\\f\\ M for all / 6 M 
and for some small e. 

Theorem 2.1. Let Y = Y(tt) and Z be defined as in (11.41) and (11.51) . with 7r 
a uniform random permutation of {1, 2, . . . , n}, and A(a) as in ( 11.71) . for 
some n x n matrix a(i,j) with a(i,+) = and some s(a) > 0. Then there 
exists a universal constant K such that, for all f G M , 



\E(Af)(Y)\ < KA(a) l]Jm . 
Thus, for all g e M , 

\Eg(Y)-Eg(Z)\ < C KA(a)\\g\\ M , 

with Co as in (12. 3p . 

Proof. We begin by noting that 

1 n 

EDf(Y)[Y] = — -^E{X^/(y)[J v „]}, (2.4) 

K ' i=l 

where X, := a(i,7r(i)). We then write 

E{XiDf(Y)[J i/n )} = -^a(iJMDf(Y(ir))[J i/n ]\ir(t) = l}. (2.5) 

i=i 

Now realize tt' with the distribution £(tt \ Tr(i) = I) by taking 7r to be a 
uniform random permutation, and setting 

7r' = 7r, if 7r(z) = I; 

ir'(i) = I; = j- ir'{k) = ir{k), k £ {i, ir~\l)}, 

if 7r(i) = j ^ i. 

This gives 

Y(ir') = Y(7t) + A u (n) =: F'(7r), (2.6) 

where 

s(a)Au(n) := {a(i, l)-a(i, ir(i))} J i/n +{a(7r~ 1 (/), 7r(i))-a(7r _1 (/), /)} J ff -i(j)/ n , 

(2.7) 
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and Y'(tt) has the distribution C(Y(ir) \ ir(i) = I). Hence, putting (I2.6P 
into (12.51) . it follows that 

-}-E{X t Df(Y)lJ l/n ]} = -4 T y)a(i,Z)E{D/(y( 7 r)+A <l (7r))[J i/n ]}. 
s(a) ns{a) 

(2.8) 

Using Taylor's expansion, and recalling the definition ( 12. ID of || ■ \\m, we 
now have 

\E{Df(Y + A u )[J i/n }} - E{Df(Y)[J t/n ]} - E{D 2 f(Y)[J i/n , A u }}\ 

< ||/|U/E||A.,|| 2 , (2.9) 

where, from ( 12.7ft . 

||A«(7r)|| < {s(a)}- l {\a(t, l)\ + \a(i, tt(z))| + la^O, tt(z))| + K^Z), Z)|}. 

(2.10) 

Laborious calculation now shows that 

1 n n -. n n 

EEwM) | E iiA a ir < ^EEw.')i 3 = 

V ' i=l 1=1 V ' i=l i=l 

(2.11) 

for a universal constant Ci; for instance, 

j n n . 

E E l a ^' ^'^y E l a ^' 7r ( i )) a ( 7r_1 (0> ^(0)1 

^ ' i=l {=1 v ' 

< 



1 


ns 3 ( 


a) 


1 




ns 3 ( 


a) 


1 





i=l 1=1 v y j^Z fe^i 



n n 

I 

< 



-|a(i,0r + -y^EEHl°( < '0l 8 + |a(i,j)l 8 + |a(*,j)l 8 } 

i=l !=1 v 7 j^Z fe^i 



n n 



v I i=\ i=i 

Thus, in view of ( 12.8j) . when evaluating the right hand side of ( 12.41) . we have 

. n n 

EDf(Y)[Y] = -^^^a(z,Z)(E{ J D/(F)[J l/n ]}+E{ J D 2 /(^)[J l /n ! A ^})+^ ) 

(2.12) 



ns\a) 

v > i=i j=i 



where |t?i| < CiA(a)||j || A f- 
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Now, because a(i, +) = 0, the first term on the right hand side of (I2.12p 
is zero, so we have only the second to consider. We begin by writing 

D 2 f(Y)[J i/n ,A il ] = ^/WfelAJ+^/W^Ai-EAii]. (2.13) 

From (12.71) . it follows easily that 

E{D 2 f(Y)[J i/n ,EA u }} = { S (a)}- 1 a(i,0E{D 2 /(y)[4«]} ( 2 - 14 ) 



J2<r,l)E{D 2 f(Y)[J i/n ,J r/n ]}. 



(n-l)s(o) r 

Substituting this into (12.121) gives a contribution to EDf(Y)[Y] of 

1 n n 

v > i=l «=1 

n n 

" n(n-l)s*(a) E E E °( r > OE^/M [J«/ B , J P /„]} 

^ ' V ' i=l Z=l r^i 

n n 

= E ^:E{5 2 /(r)[^]} + E E ^ E { 52 /(^)[^/n, J,/„]}, (2.15) 

i=l i=l r^i 

from dH]), and thus, from (Q, fl2~T2l and (|213|) . and noting that P5| 
cancels the second term in 02.211 . 



where 



< 



|EW)Q0I < M + M, (2.16) 



1 n n 

7T E E l a ^> Z )l PEpVQO [•*/», A« - EA«]}|. (2.17) 



i=l Z=l 



It thus remains to find a bound for this last expression. 
To address this last step, we write 

E{D 2 f(Y)[J i/n ,A u -EA u ]} 

n 

= PjkE{D 2 f(Y)[J l/n ,A u -EA tl ]\7r(t)=j,7c~\l) = k}, 
j,k=i 

where pjk := P[7r(i) = j, 7r _1 (/) = k]; note that pu = 1/n, and that pjk = 
l/n(n — 1) for j ^ I, k ^ i. We then observe that, much as for (12.61) . 

Y"(n) := Y(n) + A> hjk (n) ~ C(Y(tt) | = j, = k), (2.18) 
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where, for j ^ I, k ^ i, 

s (a)A^ fc (7r) : = {Hhj) - a(i,n(i))]J i/n + [a(k,l) - a(k, n(k))] J k / n 
+ [a(7r _1 (/), ir(k)) - a(7r _1 (/), l)]J v -i(jy n 

+ {[a(i,j) - a(i,ir(i)))Ji/ n + [a(k,l) - a(k,j))J k/n 

+ [a(7T~ (Z), 7T(i)) - a(TT~ l (l), l))J^{l)/n]^-{Tr{i)^l,Tr{k)=j} 

+ {[a(i,j) - a(i,/)]J i/n + [a(k,l) - a(k,ir(k)))J k/n 

+ [a(7r _1 (j),7r(A;)) - a(7r" 1 (j), j)] J^-i (i)/ „,}l {w(i)=i} , 

(2.19) 

and 

(2.20) 

Then A iZ = Ajj(7r(z), 7r _1 (/)) is measurable with respect to er(7r(i), 7r _1 (/)), 
and 

n n 

^^p jfc E{D 2 /(^)[^/n, A« -EA«] |tt(z) = ^vr-HO = A;} 

3=1 k=l 

n n 

= ^2^2Pjk^{D 2 f(Y + A' u . jk )[J i/n , A a {j, k) - EAu]} 

3=1 k=l 
n n 

= E ^P3iM{D 2 f(Y)[J i/n , A u (j, k) - EA U }} 

3=1 k=l 

n n 

+ E ^2pjk^{D 2 f{Y + A' il;jk )[J i/n , A u (j, k) - EA a ] 

3=1 k=l 

- D 2 f{Y)[J i/n ,A a {j,k) -EAu}}. (2.21) 

Now, since Y^j=i Y^k=iPjh^u{ji k) = EA it , the first term in (12.211) is zero, by 
bilinearity. For the remainder, we have 

\\D 2 f{Y + A! a .j k )[J i / n , Au(j, k) — EAu) — D 2 f(Y)[J i/n , A a (j, k) - EAu) \\ 
< ll/llMl|A / ,, fc ||{||A,(j,A;)|| + ||EA^||}, (2.22) 

so that, from (12X71) . 

1 n n n n 

1*1 - II^^EEK^)! EE^ e H a ^II{II a ^j^)II+IIea,||}. 

v I i=l 1=1 3=1 k=l 

(2.23) 
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Here, from (12. 7p . (I2.10p and (I2.19p . each of the norms can be expressed as 
l/s(a) times a sum of elements of \a\. Another laborious calculation shows 
that indeed 

M < C 2 A(a)\\f\\ M , 
and the theorem is proved. □ 



3 A functional limit theorem 

The pre-limiting approximation is simpler than the original process, inas- 
much as it involves only jointly Gaussian random variables with prescribed 
covariances. However, if the matrix a can be naturally imbedded into a se- 
quence exhibiting some regularity as n varies, and if n is large, it may 
be advantageous to look for an n-independent limiting approximation, in the 
usual sense of weak convergence. Unfortunately, the approximation given in 
Theorem 12. II is not naturally compatible with weak convergence with respect 
to the Skorohod metric, and something extra is needed. With this in mind, 
we prove the following extension of Theorem 2 of Barbour (1990). To do so, 
we introduce the class of functionals g G M° C M for which 

IM|m° := \\g\\ M + sup \g(w)\ + sup \\Dg(w)\\ + sup \\D 2 g(w)\\ < oo. 

Proposition 3.1. Suppose that, for each n > 1, the random element Y n 
of D := D[0, 1] is piecewise constant, with intervals of constancy of length at 
least r n . Let Z n , n > 1, be random elements of D converging weakly in D to 
a random element Z of C[0, 1]. Then, if 

\Eg(Y n )-Eg(Z n )\ < Cr n \\g\\ M0 (3.1) 

for each g G M° , and if r n log 2 (l/r n ) ^0 as n — > oo, then Y n — > Z in D. 

Proof. First note that, by Skorohod's representation theorem, we may as- 
sume that the processes Z n and Z are all defined on the same probability 
space, in such a way that Z n — > Z in D a.s. as n — > oo. Since Z is continuous, 
this implies that \\Z n — Z\\ — > a.s. 

As in the proof of Barbour (1990, Theorem 2), it is enough to show that 

F[Y n G B] -> P[Z G B] (3.2) 

for all sets B of the form Hkkl^i where Bi = {w G D : \\w — si\\ < 7;} 
for si G C[0, 1], and ¥[Z G dBi] = 0. To do so, we approximate the 
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indicators I[Y n G Bj\ from above and below by functions from a family 
g := g{e,p, p,i], s} in M°, and use (13.11) . We define 

g{z,P,P,V> s }(w) := (j)p,r,(h £jP (w - a)), 

where 

KM ■= ( [\e 2 + y 2 (t))^dt) 1/P =: \\{e 2 + y 2 ) l ' 2 \\ p , 



and cj) p ^{x) := (p((x — p)/r)), for 0: R + — > [0, 1] non- increasing, three times 
continuously different iable, and such that 4>(x) = 1 for x < and 4>(x) = for 
x > 1. Note that each such function g is in M°, and that H^Ha/ < C'p 2 e~ 2 ri~ 3 
for a constant C" not depending on e,p, p, r], s, and that the same is true for 
finite products of such functions, if the largest of the p's and the smallest of 
the e's and 77's is used in the norm bound. 
Now, if x G B[, it follows that gi(x) = 1, for 

9i ■= 0{e7i,P,7i( 1 + e2 ) 1/2 > 7 Mj}> 
for all s,p,T]. Hence, for all £,p,r), 

L L 

P ~ 

KKI ~ "i=l "i=l 



2„-3 



K„G f) £,] < E{j]^(F n )} < E{j]^(Z n )}+Cr n C> 2 (£ 7 )^ 

(3.3) 



where 7 := mini</<^7;. Then, by Minkowski's inequality, 

h e , p {Z - 81) < h £iP (Z n - ai) + \\Z n - Z\\ p < h £ , p (Z n - si) + \\Z n - Z\\. 
Hence, if p n — > 00 as n —>■ 00 and e is fixed, 

\immfh £ , Pn (Z n - Sl ) > \imM{h £jPn (Z- Sl )-\\Z n -Z\\} = ||( £ 2 +|Z- Si | 2 ) 1/2 || a.s. 

n^oo ri— >oo 

It thus follows that, if \\Z — sj|| > 7;, and if r] n — > as n — > 00, then 
liminf^^^-s,)-^} > ||(£ 2 7 f + |^-s ; | 2 ) 1/2 || > 7 K1+£ 2 ) 1/2 a.s., 

n— >oo 

and so gi n (Z n ) = for all n sufficiently large, where 

9ln ■= g{£Jl,Pn,Jl(' i - + £ 2 Y /2 ,Vn,Si}. 

Applying Fatou's lemma to 1 — Yli=i 9in(Z n ) , an d because F[Z G dB{\ = 
for each I, we then have, 

L L 

limsupE-j JJ^„(Z„) I < Eilimsup JJ^„(Z„) \ 

1=1 

L 

< E(jjl{||^-*i||<7i}) = nZeB}. 



i=i 



8=1 
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Thus, letting p n — ► oo and r\ n — » in such a way that T n p^ri~ 3 — > 0, it follows 
from (13.31) that limsup^^ F\Y n G B] < ¥[Z G B], and we have proved one 
direction of ( 13.21) . 

For the other direction, fix 9 > small, and let 5 > be such that, if 
\\Y n - si\\ > 7/, then 

leb{t: |y„(t)-a,|>7j(l-0)}> (<*A§r„). (3.4) 

Such a 5 exists, because the collection (si, 1 < I < L) is uniformly equicon- 
tinuous, and because the functions Y n are piecewise constant on intervals of 
length at least r n . Hence, for such Y n , 

h £luP {Y n - Sl ) > 7i{e a + (l-e) 2 } 1/2 (*Air B ) 1/p , 
and thus gi{Y n ) = 0, where, for any p and rj, 

9l ■= g{e lh p, ll (e 2 + (l-9) 2 ) 1 / 2 (SA 1 1 r n ) 1/p -r ] ,r ] ,s l }. 
Thus, for any p and h, I[Y n G > g*(Y n ), and hence 

L L 

pke n s '] ^ E {ru*( y «)} ^ E{n^(^)}-^„c> 2 (e 7 )-v 3 - 



KKL 



i=l 



i=l 



(3.5) 

Now suppose that ||Z — sj|| < 7/(1 — 9). Then there exists an a > such 
that a.s. ||Z n — < 7^(1 — 9) — a for all n sufficiently large. This in turn 
implies that 

h eiltPn (Z n - Sl ) < {s 2 1 f+\\Z n -s l \\ 2 } 1 / 2 < ll {s 2 + (l-9-a 1 r 1 ) 2 } 1/2 
< li{e 2 + {l-9fYl 2 {5A\r n ) 1/p --r ln 

for all n large enough, if r\ n — * and p n — > oo in such a way that r^' 1 — > 1. 
This in turn implies that gl n {Z n ) = 1 for all n large enough, where 

9l := ^{e^^TK^ + a-^^^AirJ 1 ^-^^^^}. (3.6) 
Hence 



E{liminfn^ n (Z n )} > P[ D {||Z-^||< 7 K1-^)}" 

i=l Kl<i 

Applying Fatou's lemma, and recalling (13. 5p . we now have a.s. 

L L 



(3.7) 



lim inf P 

n— >oo 



V n G f| £,] > UminfE{lJ^(Z n )} > e{ lim inf Y[gl(Z n )}, 

(3.8) 



KKL 



i=l 



i=l 
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provided that also T n p^r]~ 3 — ► 0: this can be arranged by judicious choice of 
p n — > oo and r\ n — * if, as assumed, r n log 2 (l/r n ) — > 0. Hence, since 9 was 
chosen arbitrarily, it follows from (13.71) and ( 13. 8ft that 

liminfP[F n G B] > P[Z G B], 

n— >oc 

and the theorem is proved. □ 



Note that, in Barbour (1990, Theorem 2), restricting to functions g satis- 
fying (2.32) of that paper is not permissible: the bound (13. ip is needed for 
functions in Mq that do not necessarily satisfy (2.32). 

Remark 3.2. The assumption that Y n is piecewise constant can be relaxed 
to Y n being piecewise linear, with intervals of linearity of length at least r n ; 
in particular, this allows processes Y n obtained by linear interpolation. The 
only difference in the proof is that, if \\Y n — si\\ > ji, then |F n (t ) — si(t )\ > 
(1 — 9/ '4)7 1 for some to- Thus, by the assumption on Y n and the continuity 
of Si, there exists an interval Iq of length at least l n : = |r n A 5, with to a s 
an endpoint, on which Y n is linear and |s;(t) — (to) I < 9^/4. A simple 
geometrical argument now shows that \Y n {t) — si(t )\ > (1 — 9/2)ji in a 
subinterval of length at least 9l n /8, at one or other end of I Q . Hence, (13 .4p 
can be replaced by 

leb{t: \Y n (t)- Sl \ > 7/(1-0)} > |(<5Af n ), 

and the rest of the proof is the same. 

We now turn to proving a functional limit theorem for the sums derived 
from a sequence of matrices cS n \ n > 1. Supposing that s^ n \a) > 0, we 
define functions 

, l nt \ n 

^ (i ) : =^w^^ (a< " l(M))2; 

l ~\ l ~\ (3.9) 

v ^ 11 i=i j=i 1=1 

for < t,u < 1. Note that if we choose s {n) (a) by ([EH]), then / n (l) = 
(n — l)/n — > 1. Conversely, if /n(l) converges to a limit c > 0, then s ( - n \a) 
differs from the value in ( 11.81) only by a factor c -1 / 2 + o(l). 
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Theorem 3.3. Suppose that f n —>f and g n g pointwise, with f continu- 
ous, and that A^ n \a) log 2 n — > 0. Then there exists a zero mean continuous 
Gaussian process Z on [0, 1] with covariance function given by 

Cov(Z(t),Z(u)) = a{t,u) := f(t A u) - g{t,u), (3.10) 

and Y n —* Z in D[0, 1]. 

Proof. Fix n > 2. We begin by realizing the random variables as func- 
tions of a collection (Xn, i, I > 1) of independent standard normal random 
variables. Writing Xi := n~ x Yl^i^-u, we se ^ 

n 

wf := , n)( ' = oW(*,0(^-^); ^ W := £w£°- ( 3 - n ) 

Direct calculation shows that, with 5jj the Kronecker delta, 

n 

Cov(W$ n) ,w} n) ) = J2 Gov ( W u\w^) (3.12) 

i=i 

n 

= E ( „- 1 )(. W ( B))3 ° w ('-')° M aow i ,-~-'). 

in accordance with (11.61) . so we can set 

n 

:= ^^ (n) J j/n . (3.13) 



i=i 



Now Theorem O shows that \E{g(Y n ) - g(Z n )}\ < CA (n) (a) ||#|| M o for any 
g G M°; furthermore, the process Y n is piecewise constant on intervals of 
lengths 1/n, and, by assumption, A^ n ^(a)log 2 n — ► 0. Hence, in order to 
apply Proposition 13.11 it is enough to show that Z n — > Z for a continuous 
Gaussian process. 

Write Z n = Zn 1 " 1 — Z^ 2 " 1 , where 

Z ( 2 )(t) : = - - -L yy fl (")(i,i)i,, 

The process Zn 1 "* is a Gaussian process with independent increments, and 
can be realized as W(f n (-)), where W is a standard Brownian motion and 
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f n (t) := nf n (t)/(n — 1). Now / is continuous, by assumption, and each /„ 
is non-decreasing, so f n — > / uniformly on [0,1], and hence W(f n (-)) — ► 
W(/(0) in £>[0, 1]. Since the latter process is continuous, it follows that the 
sequence Zn is C-tight in D[0, 1]. 

To show that Z„ is also C-tight, we use criteria from Billingsley (1968). 
For < t < u < 1, it follows from (13.141) and Holder's inequality that 

1 n L nu J 2 1 



i=[ntj+l 

n L nu J 



- n (n-l)( a W(a))^ L JJ tr^ +1 l 

< /.(i) LH- Ln*J . 

n — 1 



Hence, since Z^ is Gaussian, we have 



E\ZW(u)-ZW(t)\* = mZ^\u)-Z^(t)\ 2 f < ^{Ul ^l^ ) 2 . 

(3.15) 

Thus, if0<t<f<-u<l and u — t > 1/n, it follows that 

E{|Z?)(t;)-Z«(0| 2 |Z«(«)-ZJ?)(t;)| 2 } 



< ^E|2< 2) (u) - 4 2) (*)| 4 E|4 2) (u) - 4 2) («)| 4 

< 3/ n * ( i)(M^M M^Mn < 12/ a (1)(w _ t)2 . (3 . 16) 

V n — 1 n — 1 / 

the inequality is immediate for u — t < 1/n, since then [rif J e {|_^J ; L™J}- 
Now, for any < t < u < 1, we have 

Cov(Zi 2 )(t),Z( 2 )( M )) = - ^,u). 

n — 1 

Hence there exists a zero mean Gaussian process Z^ with covariance func- 

(2) 

tion g, and the finite dimensional distributions of Z\ converge to those 
of Z®. By (jHISD and Fatou's lemma, E\Z^ 2 \u) - Z® (t)\ 4 < 3/ 2 (1)(m - tf 
for any < t < u < 1, so that, from Billingsley (1968, Theorem 12.4), we 
may assume that Z^ 2 ) G C[0, 1]. From (l3~T6l) and Billingsley (1968, Theo- 
rem 15.6), it now follows that zP -> Z( 2 ) in D[0, 1]. Thus Z^ 2) is C-tight 
also. 
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Now, since both {Z { n L> } and {Zk >} are C-tight, so is their difference {Z n }. 
From KSSJ) and 1ETI2]) . for t,u E [0, 1], 

Cov(Z n (t),Z n (u)) = -f n (tAu) -g n (t,u) -> f(tAu)-g(t,u), 

n — 1 n — 1 

so that the finite dimensional distributions of Z n converge to those of a ran- 
dom element Z of C[0, 1] with covariance function cr(£, u), as required. □ 



4 Rate of convergence 

Under more stringent assumptions, the approximation of Z n by Z can be 
made sharper. To start with, note that it follows from the representation 
(13. lip and (13.131) that Z n can be written as a two dimensional stochastic 
integral 

ii f 

Zn ^ = ~Tnu~~n = ^ / a n {v,w)K{dv,dw) (4.1) 
sW(a)Vn - 1 Ji n ( t )xi 

with respect to a Kiefer process .fT, where I n {t) := [0, n -1 |_^J], I '■= [0, 1] and 
a n (v, w) := a < - n ' ) ( [nt> ] , [nit?]). Recall that the Kiefer process K has covariance 
function Cov (K(vi, Wx), K(v 2 , W2)) = (fi A v 2 — ^i^X^i A u> 2 ) and can be 
represented in the form K(v,w) = W(v,w) — vW(l,w), where W is the 
two-dimensional Brownian sheet (Shorack Sz Wellner 1986, (5) p. 30 and 
Exercise 12, p. 32). Thus K is like a Brownian bridge in v, and a Brownian 
motion in to. 

In this section, we let s(a^) be given by (|1.8p . Hence if, for example, the 
functions a n converge in L 2 to a square integrable limit a (not a.e. 0), then, 

-{s^ n \a)} 2 = \\at n \\l — > a 2 := [ dv [ dwa 2 (v,w) = \\a"° 



2 1° l"7J _ ll"n||2 ^ u a ■— 1 uju I uuj ui ^u,ujj — ||ix|| 2 , 

n Jo Jo 

and the limiting process Z can be represented as 

Z(t) = a' 1 / a(y,w)K(dv,dw), (4.2) 

i[0,t]x/ 

enabling a direct comparison between Z n and Z to be made. Since a n — >l 2 a ; 
it follows that 



(4.3) 



fn(t) -> f(t) := cr a 2 dv dwa 2 (v,w); 

Jo Jo 

rt ru pi 

g n (t,u) — > g(t,u) := a~ 2 dv dx dw a(v, w)a(x, w), 
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with / continuous, as required for Theorem 13.31 and that Z has covariance 
function a(t, u) as defined in (13.101) . For the following lemma, we work under 



silghtly stronger assumptions. 

Lemma 4.1. Suppose that a n — > a in L 2 , where a is bounded and not a.e. 0, 

and that, for some < (3 < 2, 

\g(t,t) + g(u,u)-2g(t,u)\ < C 2 g \u-tf, < t < u < 1. (4.4) 

Define a + : = ||a||oo/IM|2 < oo and e n (v,w) := 1 {« n (f, w) — a(v,w)}. 
Then, for any r > 0, there is a constant c(r) such that 

F[su V \Z n (t)-Z(t)\ > C (r){||£„|| 2 + (a + + C>-^ A1 )/ 2 }v^oi^] < n~ r , 
tel 

where Z is as defined in (14. 2p . 

Proof. Define e n (v,w) := ■j^^ Tf a n (v,w) - cr~ 1 a(t;, w). We start by con- 
sidering t of the form i/n, 1 < i < n, so that 



Z n (t) — Z{t) = I e n (v,w) K(dv, dw). 

J[0,t]xl 

From this and the representation K(v,w) = W(v,w) — vW(l,w), it fol- 
lows that maxtg/ K{Z n (t) — Z(t)} 2 < ||e n |||, and hence, from the Borell-TIS 
maximal inequality for Gaussian processes (Adler and Taylor 2007, Theo- 
rem 2.1.1), we have 



P 



max 

ten- 1 {1,2,.. .,n) 



e n (v,w) K(dv,dw) > Ci(r)\\e n \\ 2 \fiogn 

[0,t]xl 



if Ci(r) is chosen large enough. However, 

ot n a 



&n II i| i| || j 

||a„,||2 ||a|| 2 

from which it follows that 

ll^nlh < 2||e n ||2. 

It thus remains to consider the differences Z n (t)—Z(t) for t not of the form 
i/n. Between n~ 1 \nt\ and t, the process Z n remains constant, whereas Z 
changes; hence it is enough to control the maximal fluctuation of Z over 
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intervals of the form [(i — l)/n, i/n], 1 < i < n. Here, we use the Fernique- 
Marcus maximal inequality for Gaussian processes (Leadbetter et al. 1983, 
Lemma 12.2.1), together with the inequality 

\a(u,u) +a(t,t) - 2a(t,u)\ < C 2 g \t - uf + (a + ) 2 \t - u\, 

to give the bound 



P 



max sup 

l<i<n (i-l)/ n < v <i/ n 



\Z{v)-Z((i-l)/n)\ > c 2 (r)(C , 9 +a + )n- (/3A1)/2 v / log^ 



< 



if C2(r) is chosen large enough, and the proof is now complete. □ 



Note that, under the conditions of Lemma |4~TI the requirements for The- 
orem Elare fulfilled, provided that A^^a) -> fast enough. This is true 
if also, for instance, for some c < oo, Halloo < c||a||oo for all n, since then 
A(")(a) < 2ca + n~ 1 / 2 for all n large enough. Combining Theorems 12.11 
and 13.31 with Lemma 14.11 then easily gives the following conclusions. 

Theorem 4.2. Under the conditions of Lemma \4-1\ and if also ||a n ||oo/||«||oo 
is bounded, then Y n ^ d Z in D[0, 1], for Z as defined in (14.21) . and, for any 
functional g G Mq, 



\Eg(Y n )-Eg(Z)\ < C {A^ (a)+n- 1 +{\\e n \\ 2 +(a + +C g )n-^/ 2 } ||^| 

(4.5) 

for some constant C . 
Proof. We note that 

\Eg(Y n )-Eg(Z)\ < \Eg(Y n ) - Eg(Z n )\ + E\g(Z n ) - g{Z)\. 

The first term is bounded using Theorem 12.11 whereas, for any a > 0, 

E\g(Z n ) - g{Z)\ < 2 sup |<?H|P[||Z n - Z^ > a) + a sup \\Dg(w)\\ 

< \\g\\ M0 {2F[\\Z n -Z\\ oo >a]+a}, 

and the theorem follows by taking a = c(l){ ||e n || 2 +(a + +(7 9 )n~^ A1 ^ 2 } yTogra 
and applying Lemma [4. II with r = 1. □ 
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5 The shape of permutation tableaux 



We begin by studying the number of weak exceedances in a uniform random 
permutation 7T on {1, 2, . . . , n}; we shall suppress the index n where possible. 
The number of weak exceedances is defined to be the sum Y17=i^^ where 
Ii := l{7r(j)>«}- The process So(t) := Yl\=i ^ 1S thus °f the kind studied 
in the introduction, with a (i,j) := l{«<j}. Simple calculations show that 
Eij = a (i,+) = (n — i + l)/n, and thus 

a{ hJ ) = a<- n \i,j) = l {i < i} - 1 + (i - l)/n, (5.1) 
E5 (A;/n) = ^ — (5.2) 

Hence, as n — > oo, 

ES (t) = nt(l-t/2) + 0(l). (5.3) 
Further, although we will not need it, for i < j, 

^ r , n — i ^ r . (n — i)(n— 7 + 1) 

E/i^l = 7, EUJ,} = ^ J - i, 

n — 1 (n — 1 )n 



which makes it possible to calculate variances and covariances exactly. Higher 
moments can be computed exactly, too. 

We now turn to the approximation of S(t) := So(t) — ES'o(t). We first 
note that 

- oc(i/nJ/n)\ < n -1 , 

where a(t, u) := l{t< u } — 1+t, so that \a n (t, u) — a(t, u)\ < 2n~ 1 for \t — u\ > 
n , and that \a n (t,u) — a(t,u)\ < 1 for all t,u 6 J. Thus a„ -> a in L2, 
with 

= i/6 ; || £n ||2 < 18/n; a + = v 7 ^, 

and || a n || 00/ 1| « || 00 is bounded. Calculation based on (14. 3 p shows also that, 
for < t < u < 1, 

fit) = 6 f x(l - x) dx = 3t 2 - 2t 3 ; 
Jo 

r-t r-u 

g(t,u) = 6/ / {(l-xVy)-(l-x)(l-y)}dxdy 
Jo Jo 

= 3t 2 u - t 3 - \t 2 u 2 , 
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and that we can take (3 = 2 in (I4.4p . Hence we can apply Theorem 14.21 
and defining Y n by (II. 3p with fll.Sp . conclude that Y n — ► Z in D[0, 1], with 
convergence rate 0{rr x l 2 Vlog n) as measured by Mo-functionals, where Z 
is the Gaussian process given by (14.21) : 

Z(t) — Vd / {l{i,< w } - 1 + f } K(dv , dw). 

J[0,t]xl 

Note also that 

Y n (t) = y/6/^{S {t) - nt(l -t/2)} + Ofa" 1 / 2 ), (5.4) 

indicating that the approximation can be simplified, as in the following the- 
orem. 

Theorem 5.1. Let S ( n) (t) := Ej=i 4"^ where 4^ : = 1{^>(;)>*} and 7r< n > 
is a uniform random permutation on {1, 2, . . . , n}. Write fi(t) := t(l — t/2). 
Then 

Y n := n 1 ' 2 {St ] - nfi} ^ d Z in D[0,1], 

where Z is a zero mean Gaussian process with covariance function a given 
by 

a(t,u) = la(t,u) = l{f{t)-g{t,u)) = \t 2 {l-u+\u 2 )-\t\ < t < u < 1. 

The number of weak exceedances of a permutation is one of a number 
of statistics that can be deduced from the permutation tableaux introduced 
by Steingrimsson and Williams (2007). Such a tableau is a Ferrers diagram 
(possibly with some rows of length 0) with elements from the set {0, 1} 
assigned to the cells, with the following restrictions: 

1. Each column of the rectangle contains at least one 1; 

2. There is no that has a 1 above it in the same column and a 1 to its 
left in the same row. 

The length of a tableau is defined to be the sum of the numbers of its rows 
and columns, and the set of possible tableaux of length n is in one-to-one 
correspondence with the permutations of n objects. In particular, under the 
bijection between tableaux and permutations defined by Steingrimsson and 
Williams (2007, Lemma 5), the lower right boundary, which consists of a 
sequence of n unit steps down or to the left, has its i-th step down if 1^ = 1 
and to the left if 1^ = 0. Hence the Theorem 15.11 above . together with (15.31) . 
provides information about the asymptotic shape of the lower right bound- 
ary T n of the tableau corresponding to a randomly chosen permutation. Let 
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the upper left corner of the Ferrers diagram represent the origin with the x- 
axis to the right and the |/-axis vertically downward, so that the lower right 
boundary runs from (n — So(l),0) to (0,So(l)): then T n consists of the set 
{(n — Sq(1) — l+So(l), Sq(1)), < I < n}, linearly interpolated. Hence, n _1 T n 
is approximated within 0(n _1 ) by the curve 

{ (i[l_ e\ + n- 1 ' 2 (Y n (t) - Y n (l)), |[1 - (1 - t) 2 ] + n- 1 ' 2 Y n (t)), < t < 1}, 
where Y n is as defined in Theorem 15.11 

Corollary 5.2. As n — > oo, n -1 !^ can fre approximated in distribution by 
{(![!_ t 2] + „-i/a ( ^ (t) _ ^ n(1)); I[i _ (i _ t) 2 ] + n -i/a 
< t < 1}, 

an error o(n -1 / 2 ). 

In particular, as can also be seen more directly, ra~ 1 r„ converges in prob- 
ability to the deterministic curve 

{(i[l-t 2 ], §[l-(l-t) 2 ], < t < 1} = {(x,y) G [0,oo) 2 : x+y = \-{x-y) 2 }, 

an arc of a parabola. 

Another statistic of interest is the area A n of such a tableau, which is 
given by the formula A n := Yui=i X}j=i+i(l — ^j)> again because of the 
bijection above. Direct computation yields the expression 

n 

A n = ^5 (z/n)-l5 2 (l)-i5 (l) 

i=l 
n 

= - i/2n) + yfaY n {i/n)} - \{{n/2) + v^?n(l)} 2 

i=l 

-i{(n/2) + v ^r n (i)} 

= h -^ + n^ 2 {n-^Y,S/n)-\Y n {l)} 

i=l 

-\{Vn~Y n (l) + nY n (l) 2 }. 

This leads to the following limiting approximation. 
Corollary 5.3. As n — > oo, 
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Proof. By the continuous mapping theorem and Slutsky's lemma, it is im- 
mediate from Theorem 15.11 that 

n-' i/2 {A n -^) ^ d fj{t)dt-\Z{l). 

Now the random variable {J^ Z(t) dt — |Z(1)} has mean zero and variance 

/ / a(t,u)dudt- f a(t,l)dt+\a{l,l), 
Jo Jo Jo 

with a as in Theorem 15.11 and this gives the value 1/144. The corollary 
follows. □ 



Note also that the number of rows in the permutation tableau R n = 5*0(1); 
hence Theorem 15.11 implies also, using <r(l, 1) = 1/12, 

n- 1 ' 2 (R n - \n) ^ d JV(0,i). 

This, however, does not require the functional limit theorem; it follows by the 
arguments above from Hoeff ding's (1951) combinatorial central limit theo- 
rem, and it can also be shown in other ways, see Hitczenko and Janson 
(2009+). 
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